Search CORE

14 research outputs found

Adaptive DCTNet for Audio Signal Classification

Author: Gan Zhe
Lu Liang
Pu Yunchen
Thompson Andrew
Xian Yin
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 29/04/2017
Field of study

In this paper, we investigate DCTNet for audio signal classification. Its output feature is related to Cohen's class of time-frequency distributions. We introduce the use of adaptive DCTNet (A-DCTNet) for audio signals feature extraction. The A-DCTNet applies the idea of constant-Q transform, with its center frequencies of filterbanks geometrically spaced. The A-DCTNet is adaptive to different acoustic scales, and it can better capture low frequency acoustic information that is sensitive to human audio perception than features such as Mel-frequency spectral coefficients (MFSC). We use features extracted by the A-DCTNet as input for classifiers. Experimental results show that the A-DCTNet and Recurrent Neural Networks (RNN) achieve state-of-the-art performance in bird song classification rate, and improve artist identification accuracy in music data. They demonstrate A-DCTNet's applicability to signal processing problems.Comment: International Conference of Acoustic and Speech Signal Processing (ICASSP). New Orleans, United States, March, 201

arXiv.org e-Print Archive

Crossref

Generating Images Instead of Retrieving Them : Relevance Feedback on Generative Adversarial Networks

Author: Babenko Artem
Choi Yunjey
Datta Ritendra
Gonzalez-Garcia Abel
Goodfellow Ian J.
Karras Tero
Krizhevsky Alex
Pu Yunchen
Sangkloy Patsorn
Xu Kelvin
Yang Li-Chia
Zhang Han
Zhou Xiang Sean
Publication venue: ACM, Association for Computing Machinery
Publication date: 01/01/2020
Field of study

Finding images matching a user’s intention has been largely basedon matching a representation of the user’s information needs withan existing collection of images. For example, using an exampleimage or a written query to express the information need and re-trieving images that share similarities with the query or exampleimage. However, such an approach is limited to retrieving onlyimages that already exist in the underlying collection. Here, wepresent a methodology for generating images matching the userintention instead of retrieving them. The methodology utilizes arelevance feedback loop between a user and generative adversarialneural networks (GANs). GANs can generate novel photorealisticimages which are initially not present in the underlying collection,but generated in response to user feedback. We report experiments(N=29) where participants generate images using four differentdomains and various search goals with textual and image targets.The results show that the generated images match the tasks andoutperform images selected as baselines from a fixed image col-lection. Our results demonstrate that generating new informationcan be more useful for users than retrieving it from a collection ofexisting information.Peer reviewe

Crossref

Helsingin yliopiston digitaalinen arkisto

Communication-Efficient Stochastic Gradient MCMC for Neural Networks

Author: Carin Lawrence
Chen Changyou
Henao Ricardo
Li Chunyuan
Pu Yunchen
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 17/07/2019
Field of study

Learning probability distributions on the weights of neural networks has recently proven beneficial in many applications. Bayesian methods such as Stochastic Gradient Markov Chain Monte Carlo (SG-MCMC) offer an elegant framework to reason about model uncertainty in neural networks. However, these advantages usually come with a high computational cost. We propose accelerating SG-MCMC under the masterworker framework: workers asynchronously and in parallel share responsibility for gradient computations, while the master collects the final samples. To reduce communication overhead, two protocols (downpour and elastic) are developed to allow periodic interaction between the master and workers. We provide a theoretical analysis on the finite-time estimation consistency of posterior expectations, and establish connections to sample thinning. Our experiments on various neural networks demonstrate that the proposed algorithms can greatly reduce training time while achieving comparable (or better) test accuracy/log-likelihood levels, relative to traditional SG-MCMC. When applied to reinforcement learning, it naturally provides exploration for asynchronous policy optimization, with encouraging performance improvement

Association for the Advancement of Artificial Intelligence: AAAI Publications